AITopics | computer control

Collaborating Authors

computer control

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

AI Agents for Computer Use: A Review of Instruction-based Computer Control, GUI Automation, and Operator Assistants

Sager, Pascal J., Meyer, Benjamin, Yan, Peng, von Wartburg-Kottler, Rebekka, Etaiwi, Layan, Enayati, Aref, Nobel, Gabriel, Abdulkadir, Ahmed, Grewe, Benjamin F., Stadelmann, Thilo

arXiv.org Artificial IntelligenceJan-27-2025

Instruction-based computer control agents (CCAs) execute complex action sequences on personal computers or mobile devices to fulfill tasks using the same graphical user interfaces as a human user would, provided instructions in natural language. This review offers a comprehensive overview of the emerging field of instruction-based computer control, examining available agents -- their taxonomy, development, and respective resources -- and emphasizing the shift from manually designed, specialized agents to leveraging foundation models such as large language models (LLMs) and vision-language models (VLMs). We formalize the problem and establish a taxonomy of the field to analyze agents from three perspectives: (a) the environment perspective, analyzing computer environments; (b) the interaction perspective, describing observations spaces (e.g., screenshots, HTML) and action spaces (e.g., mouse and keyboard actions, executable code); and (c) the agent perspective, focusing on the core principle of how an agent acts and learns to act. Our framework encompasses both specialized and foundation agents, facilitating their comparative analysis and revealing how prior solutions in specialized agents, such as an environment learning step, can guide the development of more capable foundation agents. Additionally, we review current CCA datasets and CCA evaluation methods and outline the challenges to deploying such agents in a productive setting. In total, we review and classify 86 CCAs and 33 related datasets. By highlighting trends, limitations, and future research directions, this work presents a comprehensive foundation to obtain a broad understanding of the field and push its future development.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2501.1615

Country:

Europe > Switzerland > Zürich > Zürich (0.14)
Asia > Japan > Honshū > Chūbu > Toyama Prefecture > Toyama (0.04)
North America > Canada > Quebec > Montreal (0.04)
(5 more...)

Genre:

Workflow (1.00)
Research Report (1.00)
Overview (1.00)

Industry: Information Technology > Security & Privacy (0.45)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.93)

Add feedback

A Zero-Shot Language Agent for Computer Control with Structured Reflection

Li, Tao, Li, Gang, Deng, Zhiwei, Wang, Bryan, Li, Yang

arXiv.org Artificial IntelligenceOct-23-2023

Large language models (LLMs) have shown increasing capacity at planning and executing a high-level goal in a live computer environment (e.g. MiniWoB++). To perform a task, recent works often require a model to learn from trace examples of the task via either supervised learning or few/many-shot prompting. Without these trace examples, it remains a challenge how an agent can autonomously learn and improve its control on a computer, which limits the ability of an agent to perform a new task. We approach this problem with a zero-shot agent that requires no given expert traces. Our agent plans for executable actions on a partially observed environment, and iteratively progresses a task by identifying and learning from its mistakes via self-reflection and structured thought management. On the easy tasks of MiniWoB++, we show that our zero-shot agent often outperforms recent SoTAs, with more efficient reasoning. For tasks with more complexity, our reflective agent performs on par with prior best models, even though previous works had the advantages of accessing expert traces or additional screen information.

agent, llm, reflection, (14 more...)

arXiv.org Artificial Intelligence

2310.0874

Country:

North America > Canada > Ontario > Toronto (0.14)
Oceania > Australia > Victoria > Melbourne (0.04)
North America > United States (0.04)

Genre: Research Report > Experimental Study (0.46)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

Why we need human-centered AI

#artificialintelligenceApr-11-2022, 22:05:45 GMT

Welcome to AI book reviews, a series of posts that explore the latest literature on artificial intelligence. There are two contrasting but equally disturbing images of artificial intelligence. One warns about a future in which runaway intelligence becomes smarter than humanity, creates mass unemployment, and enslaves humans in a Matrix-like world or destroys them a la Skynet. A more contemporary image is one in which dumb AI algorithms are entrusted with sensitive decisions that can cause severe harm when they do go wrong. What both visions have in common is the absence of human control.

application, human control, shneiderman, (14 more...)

#artificialintelligence

Country: North America > United States > Maryland (0.05)

Industry:

Banking & Finance (0.51)
Health & Medicine (0.31)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (1.00)

Add feedback

Why we need human-centered AI

#artificialintelligenceApr-5-2022, 10:50:54 GMT

application, human control, shneiderman, (14 more...)

#artificialintelligence

Country: North America > United States > Maryland (0.05)

Industry:

Banking & Finance (0.51)
Health & Medicine (0.31)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (1.00)

Add feedback

The case for human-centered AI

#artificialintelligenceMar-22-2022, 00:41:14 GMT

application, human control, shneiderman, (14 more...)

#artificialintelligence

Country: North America > United States > Maryland (0.05)

Industry:

Banking & Finance (0.51)
Health & Medicine (0.31)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (1.00)

Add feedback

DeepMind Trains Agents to Control Computers as Humans Do to Solve Everyday Tasks

#artificialintelligenceFeb-22-2022, 15:26:51 GMT

While the design and development of contemporary AI systems has been largely results-oriented, there are also scenarios where it could be advantageous if models learned to do things "as a human would" to help with everyday tasks. That's the premise of the new DeepMind paper A Data-driven Approach for Learning To Control Computers, which proposes agents that can operate our digital devices via keyboard and mouse with goals specified in natural language. The study builds on recent developments in natural language processing, code production, and multimodal interactive behaviour in 3D simulated worlds that have enabled the generation of models with remarkable domain knowledge and desirable human-agent interaction capabilities. The proposed agents are trained on keyboard and mouse computer control for specific tasks with pixel and Document Object Model (DOM) observations, and achieve state-of-the-art and human-level mean performance across all tasks on the MiniWob benchmark. MiniWob is a challenging suite of web-browser-based tasks for computer control, ranging from simple button clicking to complex formfilling.

agent, control computer, deepmind train agent, (7 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.76)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.62)

Add feedback

The Self-Driving Car Is a Red Herring - Issue 92: Frontiers

NautilusOct-23-2020, 04:30:03 GMT

Ten years ago this fall, Google gave us a glimpse of a new device unlike any it had ever built before--a computer-controlled car. It seemed such a strange thing for an Internet company to spend its time and energy on, a "moonshot" as the company's engineers called such massive efforts. But with a single blog post, the search giant promised to reinvent our cars, and our communities, too. It was a big vision for a single invention to carry. And the details were scant. But we quickly filled in the blanks. Software was going to replace our dangerous, congested, sprawling roads with something utterly safe, seamless and organized. Humans would take the back seat in a new network of "ghost roads," as I call them.

artificial intelligence, flexible density, ghost road, (14 more...)

Nautilus

Country:

North America > United States > California (0.15)
North America > United States > New York (0.06)
North America > United States > Virginia (0.04)
(4 more...)

Industry:

Transportation > Passenger (1.00)
Transportation > Ground > Road (1.00)
Information Technology > Robotics & Automation (1.00)
(2 more...)

Technology: Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (1.00)

Add feedback

Receptive Environments for Artificial Intelligence

#artificialintelligenceDec-2-2019, 03:41:53 GMT

Related to the need for matching the corporate culture is the necessity for having a receptive environment for a project based on AI technology. In addition to the obvious importance of management support, developers and intended users must also become enthusiastic, or at least not antagonistic. During a tour of a heavy manufacturing facility, an Al development group was studying the feasibility of incorporating knowledge systems into the factory. At one point, they came across an old-time machinist who was operating a large metal-cutting machine equipped with extensive computer controls. However, the machinist was not using the computer controls at all; he even ignored the LCD display.

artificial intelligence, knowledge system, receptive environment, (4 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback

Robots could soon walk like HUMANS

AITopics Original LinksJan-18-2017, 10:32:56 GMT

Footage of robots falling over recently caused hilarity on social media. While we may be impressed by their artificial intelligence, humanoids often have an awkward, stumbling gait. Now scientists have developed a new system that they say will allow future robots to walk in the same way as humans, and avoid being knocked over easily. The technology could allow robots to one day take over human jobs, such as serving in the armed forces or doing household chores, the researchers claim. Scientists have developed a system that they say will allow robots to walk in the same way as humans.

artificial intelligence, locomotion, robot, (15 more...)

AITopics Original Links

Country:

North America > United States > Oregon (0.06)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.05)
Europe > Germany > North Rhine-Westphalia > Upper Bavaria > Munich (0.05)

Technology: Information Technology > Artificial Intelligence > Robots (1.00)

Add feedback

Creepy RatCar could drive mobility research

AITopics Original LinksJan-18-2017, 10:08:36 GMT

The cyborg armies of the future just got one step closer to total domination. Probably by taking a break from building giant fighting robots, scientists at the University of Tokyo have created the RatCar, a wheeled contraption controlled by a rat's brain. The researchers wanted to prove a simple idea right: that animals could use the parts of their brains that control limbs to control a vehicle. It looks like they can. The goal of the research was to see if it might eventually be feasible for paralyzed people to control wheelchairs using brain implants.

artificial intelligence, drive mobility research, ratcar, (6 more...)

AITopics Original Links

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.26)
South America > Argentina > Pampas > Buenos Aires F.D. > Buenos Aires (0.06)

Industry: Health & Medicine > Therapeutic Area > Neurology (0.41)

Technology: Information Technology > Artificial Intelligence (0.41)

Add feedback